Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 24
Filtrar
1.
Nat Genet ; 55(3): 389-398, 2023 03.
Artículo en Inglés | MEDLINE | ID: mdl-36823319

RESUMEN

Interacting proteins tend to have similar functions, influencing the same organismal traits. Interaction networks can be used to expand the list of candidate trait-associated genes from genome-wide association studies. Here, we performed network-based expansion of trait-associated genes for 1,002 human traits showing that this recovers known disease genes or drug targets. The similarity of network expansion scores identifies groups of traits likely to share an underlying genetic and biological process. We identified 73 pleiotropic gene modules linked to multiple traits, enriched in genes involved in processes such as protein ubiquitination and RNA processing. In contrast to gene deletion studies, pleiotropy as defined here captures specifically multicellular-related processes. We show examples of modules linked to human diseases enriched in genes with known pathogenic variants that can be used to map targets of approved drugs for repurposing. Finally, we illustrate the use of network expansion scores to study genes at inflammatory bowel disease genome-wide association study loci, and implicate inflammatory bowel disease-relevant genes with strong functional and genetic support.


Asunto(s)
Biología Celular , Células , Enfermedad , Estudios de Asociación Genética , Pleiotropía Genética , Estudios de Asociación Genética/métodos , Humanos , Ubiquitinación/genética , Procesamiento Postranscripcional del ARN/genética , Células/metabolismo , Células/patología , Reposicionamiento de Medicamentos/métodos , Reposicionamiento de Medicamentos/tendencias , Enfermedad/genética , Enfermedades Inflamatorias del Intestino/genética , Enfermedades Inflamatorias del Intestino/patología , Estudio de Asociación del Genoma Completo , Fenotipo , Enfermedades Autoinmunes/genética , Enfermedades Autoinmunes/patología
2.
Nucleic Acids Res ; 51(D1): D1353-D1359, 2023 Jan 06.
Artículo en Inglés | MEDLINE | ID: mdl-36399499

RESUMEN

The Open Targets Platform (https://platform.opentargets.org/) is an open source resource to systematically assist drug target identification and prioritisation using publicly available data. Since our last update, we have reimagined, redesigned, and rebuilt the Platform in order to streamline data integration and harmonisation, expand the ways in which users can explore the data, and improve the user experience. The gene-disease causal evidence has been enhanced and expanded to better capture disease causality across rare, common, and somatic diseases. For target and drug annotations, we have incorporated new features that help assess target safety and tractability, including genetic constraint, PROTACtability assessments, and AlphaFold structure predictions. We have also introduced new machine learning applications for knowledge extraction from the published literature, clinical trial information, and drug labels. The new technologies and frameworks introduced since the last update will ease the introduction of new features and the creation of separate instances of the Platform adapted to user requirements. Our new Community forum, expanded training materials, and outreach programme support our users in a range of use cases.

3.
Sci Rep ; 12(1): 1131, 2022 01 21.
Artículo en Inglés | MEDLINE | ID: mdl-35064169

RESUMEN

Haematological traits are linked to cardiovascular, metabolic, infectious and immune disorders, as well as cancer. Here, we examine the role of genetic variation in shaping haematological traits in two isolated Mediterranean populations. Using whole-genome sequencing data at 22× depth for 1457 individuals from Crete (MANOLIS) and 1617 from the Pomak villages in Greece, we carry out a genome-wide association scan for haematological traits using linear mixed models. We discover novel associations (p < 5 × 10-9) of five rare non-coding variants with alleles conferring effects of 1.44-2.63 units of standard deviation on red and white blood cell count, platelet and red cell distribution width. Moreover, 10.0% of individuals in the Pomak population and 6.8% in MANOLIS carry a pathogenic mutation in the Haemoglobin Subunit Beta (HBB) gene. The mutational spectrum is highly diverse (10 different mutations). The most frequent mutation in MANOLIS is the common Mediterranean variant IVS-I-110 (G>A) (rs35004220). In the Pomak population, c.364C>A ("HbO-Arab", rs33946267) is most frequent (4.4% allele frequency). We demonstrate effects on haematological and other traits, including bilirubin, cholesterol, and, in MANOLIS, height and gestation age. We find less severe effects on red blood cell traits for HbS, HbO, and IVS-I-6 (T>C) compared to other b+ mutations. Overall, we uncover allelic diversity of HBB in Greek isolated populations and find an important role for additional rare variants outside of HBB.


Asunto(s)
Índices de Eritrocitos/genética , Genética de Población , Globinas beta/genética , Estudios de Cohortes , Análisis Mutacional de ADN , Recuento de Eritrocitos , Frecuencia de los Genes , Variación Genética , Estudio de Asociación del Genoma Completo , Grecia , Humanos , Recuento de Leucocitos , Mutación , Pruebas de Función Plaquetaria , Secuenciación Completa del Genoma
4.
Sci Rep ; 11(1): 3285, 2021 02 08.
Artículo en Inglés | MEDLINE | ID: mdl-33558538

RESUMEN

Podoconiosis, a debilitating lymphoedema of the leg, results from barefoot exposure to volcanic clay soil in genetically susceptible individuals. A previous genome-wide association study (GWAS) conducted in the Wolaita ethnic group from Ethiopia showed association between single nucleotide polymorphisms (SNPs) in the HLA class II region and podoconiosis. We aimed to conduct a second GWAS in a new sample (N = 1892) collected from the Wolaita and two other Ethiopian populations, the Amhara and the Oromo, also affected by podoconiosis. Fourteen SNPs in the HLA class II region showed significant genome-wide association (P < 5.0 × 10-8) with podoconiosis. The lead SNP was rs9270911 (P = 5.51 × 10-10; OR 1.53; 95% CI 1.34-1.74), located near HLA-DRB1. Inclusion of data from the first GWAS (combined N = 2289) identified 47 SNPs in the class II HLA region that were significantly associated with podoconiosis (lead SNP also rs9270911 (P = 2.25 × 10-12). No new loci outside of the HLA class II region were identified in this more highly-powered second GWAS. Our findings confirm the HLA class II association with podoconiosis suggesting HLA-mediated abnormal induction and regulation of immune responses may have a direct role in its pathogenesis.


Asunto(s)
Elefantiasis , Etnicidad/genética , Predisposición Genética a la Enfermedad/etnología , Cadenas HLA-DRB1/genética , Polimorfismo de Nucleótido Simple , Elefantiasis/etnología , Elefantiasis/genética , Etiopía/etnología , Femenino , Estudio de Asociación del Genoma Completo , Humanos , Masculino
5.
Nucleic Acids Res ; 49(D1): D1302-D1310, 2021 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-33196847

RESUMEN

The Open Targets Platform (https://www.targetvalidation.org/) provides users with a queryable knowledgebase and user interface to aid systematic target identification and prioritisation for drug discovery based upon underlying evidence. It is publicly available and the underlying code is open source. Since our last update two years ago, we have had 10 releases to maintain and continuously improve evidence for target-disease relationships from 20 different data sources. In addition, we have integrated new evidence from key datasets, including prioritised targets identified from genome-wide CRISPR knockout screens in 300 cancer models (Project Score), and GWAS/UK BioBank statistical genetic analysis evidence from the Open Targets Genetics Portal. We have evolved our evidence scoring framework to improve target identification. To aid the prioritisation of targets and inform on the potential impact of modulating a given target, we have added evaluation of post-marketing adverse drug reactions and new curated information on target tractability and safety. We have also developed the user interface and backend technologies to improve performance and usability. In this article, we describe the latest enhancements to the Platform, to address the fundamental challenge that developing effective and safe drugs is difficult and expensive.


Asunto(s)
Antineoplásicos/uso terapéutico , Drogas en Investigación/uso terapéutico , Bases del Conocimiento , Terapia Molecular Dirigida/métodos , Neoplasias/tratamiento farmacológico , Programas Informáticos , Antineoplásicos/química , Bases de Datos Factuales , Conjuntos de Datos como Asunto , Descubrimiento de Drogas/métodos , Drogas en Investigación/química , Humanos , Internet , Neoplasias/clasificación , Neoplasias/genética , Neoplasias/patología
6.
Nucleic Acids Res ; 49(D1): D1311-D1320, 2021 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-33045747

RESUMEN

Open Targets Genetics (https://genetics.opentargets.org) is an open-access integrative resource that aggregates human GWAS and functional genomics data including gene expression, protein abundance, chromatin interaction and conformation data from a wide range of cell types and tissues to make robust connections between GWAS-associated loci, variants and likely causal genes. This enables systematic identification and prioritisation of likely causal variants and genes across all published trait-associated loci. In this paper, we describe the public resources we aggregate, the technology and analyses we use, and the functionality that the portal offers. Open Targets Genetics can be searched by variant, gene or study/phenotype. It offers tools that enable users to prioritise causal variants and genes at disease-associated loci and access systematic cross-disease and disease-molecular trait colocalization analysis across 92 cell types and tissues including the eQTL Catalogue. Data visualizations such as Manhattan-like plots, regional plots, credible sets overlap between studies and PheWAS plots enable users to explore GWAS signals in depth. The integrated data is made available through the web portal, for bulk download and via a GraphQL API, and the software is open source. Applications of this integrated data include identification of novel targets for drug discovery and drug repurposing.


Asunto(s)
Bases de Datos Genéticas , Genoma Humano , Enfermedades Inflamatorias del Intestino/genética , Terapia Molecular Dirigida/métodos , Sitios de Carácter Cuantitativo , Programas Informáticos , Cromatina/química , Cromatina/metabolismo , Conjuntos de Datos como Asunto , Descubrimiento de Drogas/métodos , Reposicionamiento de Medicamentos/métodos , Estudio de Asociación del Genoma Completo , Genotipo , Humanos , Enfermedades Inflamatorias del Intestino/tratamiento farmacológico , Enfermedades Inflamatorias del Intestino/metabolismo , Enfermedades Inflamatorias del Intestino/patología , Internet , Fenotipo , Carácter Cuantitativo Heredable
7.
Nat Commun ; 11(1): 6336, 2020 12 10.
Artículo en Inglés | MEDLINE | ID: mdl-33303764

RESUMEN

The human proteome is a crucial intermediate between complex diseases and their genetic and environmental components, and an important source of drug development targets and biomarkers. Here, we comprehensively assess the genetic architecture of 257 circulating protein biomarkers of cardiometabolic relevance through high-depth (22.5×) whole-genome sequencing (WGS) in 1328 individuals. We discover 131 independent sequence variant associations (P < 7.45 × 10-11) across the allele frequency spectrum, all of which replicate in an independent cohort (n = 1605, 18.4x WGS). We identify for the first time replicating evidence for rare-variant cis-acting protein quantitative trait loci for five genes, involving both coding and noncoding variation. We construct and validate polygenic scores that explain up to 45% of protein level variation. We find causal links between protein levels and disease risk, identifying high-value biomarkers and drug development targets.


Asunto(s)
Miocardio/metabolismo , Proteoma/genética , Secuenciación Completa del Genoma , Regulación de la Expresión Génica , Redes Reguladoras de Genes , Predisposición Genética a la Enfermedad , Humanos , Herencia Multifactorial/genética , Proteoma/metabolismo , Sitios de Carácter Cuantitativo/genética , Factores de Riesgo
8.
Genet Epidemiol ; 44(1): 79-89, 2020 01.
Artículo en Inglés | MEDLINE | ID: mdl-31520489

RESUMEN

Copy number variants (CNVs) play an important role in a number of human diseases, but the accurate calling of CNVs remains challenging. Most current approaches to CNV detection use raw read alignments, which are computationally intensive to process. We use a regression tree-based approach to call germline CNVs from whole-genome sequencing (WGS, >18x) variant call sets in 6,898 samples across four European cohorts, and describe a rich large variation landscape comprising 1,320 CNVs. Eighty-one percent of detected events have been previously reported in the Database of Genomic Variants. Twenty-three percent of high-quality deletions affect entire genes, and we recapitulate known events such as the GSTM1 and RHD gene deletions. We test for association between the detected deletions and 275 protein levels in 1,457 individuals to assess the potential clinical impact of the detected CNVs. We describe complex CNV patterns underlying an association with levels of the CCL3 protein (MAF = 0.15, p = 3.6x10-12 ) at the CCL3L3 locus, and a novel cis-association between a low-frequency NOMO1 deletion and NOMO1 protein levels (MAF = 0.02, p = 2.2x10-7 ). This study demonstrates that existing population-wide WGS call sets can be mined for germline CNVs with minimal computational overhead, delivering insight into a less well-studied, yet potentially impactful class of genetic variant.


Asunto(s)
Variaciones en el Número de Copia de ADN/genética , Genética de Población/métodos , Genoma Humano/genética , Quimiocina CCL3/genética , Eliminación de Gen , Estudio de Asociación del Genoma Completo , Genómica , Glutatión Transferasa/genética , Humanos , Proteína Nodal/genética , Proteínas Recombinantes de Fusión/genética , Secuenciación Completa del Genoma
9.
Nat Genet ; 51(2): 230-236, 2019 02.
Artículo en Inglés | MEDLINE | ID: mdl-30664745

RESUMEN

Osteoarthritis is the most common musculoskeletal disease and the leading cause of disability globally. Here, we performed a genome-wide association study for osteoarthritis (77,052 cases and 378,169 controls), analyzing four phenotypes: knee osteoarthritis, hip osteoarthritis, knee and/or hip osteoarthritis, and any osteoarthritis. We discovered 64 signals, 52 of them novel, more than doubling the number of established disease loci. Six signals fine-mapped to a single variant. We identified putative effector genes by integrating expression quantitative trait loci (eQTL) colocalization, fine-mapping, and human rare-disease, animal-model, and osteoarthritis tissue expression data. We found enrichment for genes underlying monogenic forms of bone development diseases, and for the collagen formation and extracellular matrix organization biological pathways. Ten of the likely effector genes, including TGFB1 (transforming growth factor beta 1), FGF18 (fibroblast growth factor 18), CTSK (cathepsin K), and IL11 (interleukin 11), have therapeutics approved or in clinical trials, with mechanisms of action supportive of evaluation for efficacy in osteoarthritis.


Asunto(s)
Predisposición Genética a la Enfermedad/genética , Osteoartritis de la Cadera/genética , Adulto , Anciano , Bancos de Muestras Biológicas , Estudios de Casos y Controles , Femenino , Estudio de Asociación del Genoma Completo/métodos , Humanos , Masculino , Persona de Mediana Edad , Polimorfismo de Nucleótido Simple/genética , Sitios de Carácter Cuantitativo/genética , Reino Unido
10.
Nucleic Acids Res ; 47(D1): D1005-D1012, 2019 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-30445434

RESUMEN

The GWAS Catalog delivers a high-quality curated collection of all published genome-wide association studies enabling investigations to identify causal variants, understand disease mechanisms, and establish targets for novel therapies. The scope of the Catalog has also expanded to targeted and exome arrays with 1000 new associations added for these technologies. As of September 2018, the Catalog contains 5687 GWAS comprising 71673 variant-trait associations from 3567 publications. New content includes 284 full P-value summary statistics datasets for genome-wide and new targeted array studies, representing 6 × 109 individual variant-trait statistics. In the last 12 months, the Catalog's user interface was accessed by ∼90000 unique users who viewed >1 million pages. We have improved data access with the release of a new RESTful API to support high-throughput programmatic access, an improved web interface and a new summary statistics database. Summary statistics provision is supported by a new format proposed as a community standard for summary statistics data representation. This format was derived from our experience in standardizing heterogeneous submissions, mapping formats and in harmonizing content. Availability: https://www.ebi.ac.uk/gwas/.


Asunto(s)
Bases de Datos Genéticas , Estudio de Asociación del Genoma Completo , Enfermedad/genética , Variación Genética , Humanos , Análisis por Micromatrices , Publicaciones , Programas Informáticos , Interfaz Usuario-Computador
11.
Bioinformatics ; 35(15): 2555-2561, 2019 08 01.
Artículo en Inglés | MEDLINE | ID: mdl-30576415

RESUMEN

MOTIVATION: Very low-depth sequencing has been proposed as a cost-effective approach to capture low-frequency and rare variation in complex trait association studies. However, a full characterization of the genotype quality and association power for very low-depth sequencing designs is still lacking. RESULTS: We perform cohort-wide whole-genome sequencing (WGS) at low depth in 1239 individuals (990 at 1× depth and 249 at 4× depth) from an isolated population, and establish a robust pipeline for calling and imputing very low-depth WGS genotypes from standard bioinformatics tools. Using genotyping chip, whole-exome sequencing (75× depth) and high-depth (22×) WGS data in the same samples, we examine in detail the sensitivity of this approach, and show that imputed 1× WGS recapitulates 95.2% of variants found by imputed GWAS with an average minor allele concordance of 97% for common and low-frequency variants. In our study, 1× further allowed the discovery of 140 844 true low-frequency variants with 73% genotype concordance when compared to high-depth WGS data. Finally, using association results for 57 quantitative traits, we show that very low-depth WGS is an efficient alternative to imputed GWAS chip designs, allowing the discovery of up to twice as many true association signals than the classical imputed GWAS design. AVAILABILITY AND IMPLEMENTATION: The HELIC genotype and WGS datasets have been deposited to the European Genome-phenome Archive (https://www.ebi.ac.uk/ega/home): EGAD00010000518; EGAD00010000522; EGAD00010000610; EGAD00001001636, EGAD00001001637. The peakplotter software is available at https://github.com/wtsi-team144/peakplotter, the transformPhenotype app can be downloaded at https://github.com/wtsi-team144/transformPhenotype. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Secuenciación de Nucleótidos de Alto Rendimiento , Polimorfismo de Nucleótido Simple , Genotipo , Humanos , Herencia Multifactorial , Secuenciación Completa del Genoma
12.
Nat Commun ; 9(1): 5460, 2018 12 19.
Artículo en Inglés | MEDLINE | ID: mdl-30568165

RESUMEN

The original version of this Article contained an error in Fig. 2. In panel a, the two legend items "rare" and "common" were inadvertently swapped. This has been corrected in both the PDF and HTML versions of the Article.

13.
Nat Commun ; 9(1): 4674, 2018 11 07.
Artículo en Inglés | MEDLINE | ID: mdl-30405126

RESUMEN

The role of rare variants in complex traits remains uncharted. Here, we conduct deep whole genome sequencing of 1457 individuals from an isolated population, and test for rare variant burdens across six cardiometabolic traits. We identify a role for rare regulatory variation, which has hitherto been missed. We find evidence of rare variant burdens that are independent of established common variant signals (ADIPOQ and adiponectin, P = 4.2 × 10-8; APOC3 and triglyceride levels, P = 1.5 × 10-26), and identify replicating evidence for a burden associated with triglyceride levels in FAM189B (P = 2.2 × 10-8), indicating a role for this gene in lipid metabolism.


Asunto(s)
Alelos , Carácter Cuantitativo Heredable , Secuenciación Completa del Genoma , Estudios de Cohortes , Frecuencia de los Genes/genética , Variación Genética , Humanos
14.
Nat Genet ; 50(4): 549-558, 2018 04.
Artículo en Inglés | MEDLINE | ID: mdl-29559693

RESUMEN

Osteoarthritis is a common complex disease imposing a large public-health burden. Here, we performed a genome-wide association study for osteoarthritis, using data across 16.5 million variants from the UK Biobank resource. After performing replication and meta-analysis in up to 30,727 cases and 297,191 controls, we identified nine new osteoarthritis loci, in all of which the most likely causal variant was noncoding. For three loci, we detected association with biologically relevant radiographic endophenotypes, and in five signals we identified genes that were differentially expressed in degraded compared with intact articular cartilage from patients with osteoarthritis. We established causal effects on osteoarthritis for higher body mass index but not for triglyceride levels or genetic predisposition to type 2 diabetes.


Asunto(s)
Osteoartritis/genética , Bancos de Muestras Biológicas/estadística & datos numéricos , Mapeo Cromosómico , Femenino , Sitios Genéticos , Predisposición Genética a la Enfermedad , Variación Genética , Estudio de Asociación del Genoma Completo/métodos , Estudio de Asociación del Genoma Completo/estadística & datos numéricos , Humanos , Masculino , ARN no Traducido/genética , Reino Unido
15.
Ann Rheum Dis ; 77(4): 620-623, 2018 04.
Artículo en Inglés | MEDLINE | ID: mdl-29436472

RESUMEN

OBJECTIVES: Osteoarthritis (OA) is a complex disease, but its genetic aetiology remains poorly characterised. To identify novel susceptibility loci for OA, we carried out a genome-wide association study (GWAS) in individuals from the largest UK-based OA collections to date. METHODS: We carried out a discovery GWAS in 5414 OA individuals with knee and/or hip total joint replacement (TJR) and 9939 population-based controls. We followed-up prioritised variants in OA subjects from the interim release of the UK Biobank resource (up to 12 658 cases and 50 898 controls) and our lead finding in operated OA subjects from the full release of UK Biobank (17 894 cases and 89 470 controls). We investigated its functional implications in methylation, gene expression and proteomics data in primary chondrocytes from 12 pairs of intact and degraded cartilage samples from patients undergoing TJR. RESULTS: We detect a genome-wide significant association at rs10116772 with TJR (P=3.7×10-8; for allele A: OR (95% CI) 0.97 (0.96 to 0.98)), an intronic variant in GLIS3, which is expressed in cartilage. Variants in strong correlation with rs10116772 have been associated with elevated plasma glucose levels and diabetes. CONCLUSIONS: We identify a novel susceptibility locus for OA that has been previously implicated in diabetes and glycaemic traits.


Asunto(s)
Predisposición Genética a la Enfermedad/genética , Variación Genética/genética , Osteoartritis de la Cadera/genética , Osteoartritis de la Rodilla/genética , Factores de Transcripción/genética , Adulto , Artroplastia de Reemplazo de Cadera , Artroplastia de Reemplazo de Rodilla , Cartílago/metabolismo , Estudios de Casos y Controles , Condrocitos , Metilación de ADN , Proteínas de Unión al ADN , Femenino , Expresión Génica , Estudio de Asociación del Genoma Completo , Humanos , Masculino , Osteoartritis de la Cadera/cirugía , Osteoartritis de la Rodilla/cirugía , Proteómica , Proteínas Represoras , Transactivadores
16.
Nat Genet ; 50(2): 172-174, 2018 02.
Artículo en Inglés | MEDLINE | ID: mdl-29311636

RESUMEN

We have identified a variant in ADCY3 (encoding adenylate cyclase 3) associated with markedly increased risk of obesity and type 2 diabetes in the Greenlandic population. The variant disrupts a splice acceptor site, and carriers have decreased ADCY3 RNA expression. Additionally, we observe an enrichment of rare ADCY3 loss-of-function variants among individuals with type 2 diabetes in trans-ancestry cohorts. These findings provide new information on disease etiology relevant for future treatment strategies.


Asunto(s)
Adenilil Ciclasas/genética , Diabetes Mellitus Tipo 2/genética , Mutación con Pérdida de Función , Obesidad/genética , Adulto , Anciano , Anciano de 80 o más Años , Estudios de Casos y Controles , Estudios de Cohortes , Diabetes Mellitus Tipo 2/complicaciones , Diabetes Mellitus Tipo 2/epidemiología , Femenino , Frecuencia de los Genes , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Genotipo , Groenlandia/epidemiología , Humanos , Inuk/genética , Inuk/estadística & datos numéricos , Masculino , Persona de Mediana Edad , Obesidad/complicaciones , Obesidad/epidemiología , Factores de Riesgo , Adulto Joven
17.
Nat Commun ; 8: 15606, 2017 05 26.
Artículo en Inglés | MEDLINE | ID: mdl-28548082

RESUMEN

Next-generation association studies can be empowered by sequence-based imputation and by studying founder populations. Here we report ∼9.5 million variants from whole-genome sequencing (WGS) of a Cretan-isolated population, and show enrichment of rare and low-frequency variants with predicted functional consequences. We use a WGS-based imputation approach utilizing 10,422 reference haplotypes to perform genome-wide association analyses and observe 17 genome-wide significant, independent signals, including replicating evidence for association at eight novel low-frequency variant signals. Two novel cardiometabolic associations are at lead variants unique to the founder population sequences: chr16:70790626 (high-density lipoprotein levels beta -1.71 (SE 0.25), P=1.57 × 10-11, effect allele frequency (EAF) 0.006); and rs145556679 (triglycerides levels beta -1.13 (SE 0.17), P=2.53 × 10-11, EAF 0.013). Our findings add empirical support to the contribution of low-frequency variants in complex traits, demonstrate the advantage of including population-specific sequences in imputation panels and exemplify the power gains afforded by population isolates.


Asunto(s)
Frecuencia de los Genes/genética , Genoma Humano/genética , Estudio de Asociación del Genoma Completo/métodos , Población Blanca/genética , Adulto , Anciano , Femenino , Variación Genética , Grecia , Haplotipos , Humanos , Masculino , Persona de Mediana Edad , Fenotipo , Polimorfismo de Nucleótido Simple , Secuenciación Completa del Genoma/métodos
18.
Am J Hum Genet ; 100(6): 865-884, 2017 Jun 01.
Artículo en Inglés | MEDLINE | ID: mdl-28552196

RESUMEN

Deep sequence-based imputation can enhance the discovery power of genome-wide association studies by assessing previously unexplored variation across the common- and low-frequency spectra. We applied a hybrid whole-genome sequencing (WGS) and deep imputation approach to examine the broader allelic architecture of 12 anthropometric traits associated with height, body mass, and fat distribution in up to 267,616 individuals. We report 106 genome-wide significant signals that have not been previously identified, including 9 low-frequency variants pointing to functional candidates. Of the 106 signals, 6 are in genomic regions that have not been implicated with related traits before, 28 are independent signals at previously reported regions, and 72 represent previously reported signals for a different anthropometric trait. 71% of signals reside within genes and fine mapping resolves 23 signals to one or two likely causal variants. We confirm genetic overlap between human monogenic and polygenic anthropometric traits and find signal enrichment in cis expression QTLs in relevant tissues. Our results highlight the potential of WGS strategies to enhance biologically relevant discoveries across the frequency spectrum.


Asunto(s)
Antropometría , Genoma Humano , Estudio de Asociación del Genoma Completo , Sitios de Carácter Cuantitativo/genética , Análisis de Secuencia de ADN/métodos , Estatura/genética , Estudios de Cohortes , Metilación de ADN/genética , Bases de Datos Genéticas , Femenino , Variación Genética , Humanos , Lipodistrofia/genética , Masculino , Metaanálisis como Asunto , Obesidad/genética , Mapeo Físico de Cromosoma , Caracteres Sexuales , Síndrome , Reino Unido
19.
Biochim Biophys Acta ; 1824(4): 637-46, 2012 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-22310480

RESUMEN

Charged single α-helices (CSAHs) constitute a recently recognized protein structural motif. Its presence and role is characterized in only a few proteins. To explore its general features, a comprehensive study is necessary. We have set up a consensus prediction method available as a web service (at http://csahserver.chem.elte.hu) and downloadable scripts capable of predicting CSAHs from protein sequences. Using our method, we have performed a comprehensive search on the UniProt database. We found that the motif is very rare but seems abundant in proteins involved in symbiosis and RNA binding/processing. Although there are related proteins with CSAH segments, the motif shows no deep conservation in protein families. We conclude that CSAH-containing proteins, although rare, are involved in many key biological processes. Their conservation pattern and prevalence in symbiosis-associated proteins suggest that they might be subjects of relatively rapid molecular evolution and thus can contribute to the emergence of novel functions.


Asunto(s)
Proteoma/química , Análisis de Secuencia de Proteína , Secuencia de Aminoácidos , Animales , Secuencia de Consenso , Bases de Datos de Proteínas , Evolución Molecular , Humanos , Modelos Moleculares , Anotación de Secuencia Molecular , Datos de Secuencia Molecular , Isoformas de Proteínas/química , Isoformas de Proteínas/genética , Estructura Secundaria de Proteína , Proteoma/genética , Alineación de Secuencia , Programas Informáticos
20.
PLoS One ; 6(4): e18818, 2011 Apr 18.
Artículo en Inglés | MEDLINE | ID: mdl-21533121

RESUMEN

LC8 dynein light chain (DYNLL) is a eukaryotic hub protein that is thought to function as a dimerization engine. Its interacting partners are involved in a wide range of cellular functions. In its dozens of hitherto identified binding partners DYNLL binds to a linear peptide segment. The known segments define a loosely characterized binding motif: [D/S](-4)K(-3)X(-2)[T/V/I](-1)Q(0)[T/V](1)[D/E](2). The motifs are localized in disordered segments of the DYNLL-binding proteins and are often flanked by coiled coil or other potential dimerization domains. Based on a directed evolution approach, here we provide the first quantitative characterization of the binding preference of the DYNLL binding site. We displayed on M13 phage a naïve peptide library with seven fully randomized positions around a fixed, naturally conserved glutamine. The peptides were presented in a bivalent manner fused to a leucine zipper mimicking the natural dimer to dimer binding stoichiometry of DYNLL-partner complexes. The phage-selected consensus sequence V(-5)S(-4)R(-3)G(-2)T(-1)Q(0)T(1)E(2) resembles the natural one, but is extended by an additional N-terminal valine, which increases the affinity of the monomeric peptide twentyfold. Leu-zipper dimerization increases the affinity into the subnanomolar range. By comparing crystal structures of an SRGTQTE-DYNLL and a dimeric VSRGTQTE-DYNLL complex we find that the affinity enhancing valine is accommodated in a binding pocket on DYNLL. Based on the in vitro evolved sequence pattern we predict a large number of novel DYNLL binding partners in the human proteome. Among these EML3, a microtubule-binding protein involved in mitosis contains an exact match of the phage-evolved consensus and binds to DYNLL with nanomolar affinity. These results significantly widen the scope of the human interactome around DYNLL and will certainly shed more light on the biological functions and organizing role of DYNLL in the human and other eukaryotic interactomes.


Asunto(s)
Dineínas Citoplasmáticas/metabolismo , Evolución Molecular Dirigida , Proteoma , Secuencia de Aminoácidos , Secuencia de Bases , Sitios de Unión , Cristalografía por Rayos X , Dineínas Citoplasmáticas/química , Dineínas Citoplasmáticas/genética , ADN , Dimerización , Humanos , Modelos Moleculares , Datos de Secuencia Molecular
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...